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Abstract: This paper presents BSS for additive mixing where 
every recordings consist of differently weighted signal 
Therefore, by using ICA for both time-domain and frequency- 
domain, we are going to separate source signals from mixed 
signal The main aim of our analysis is to perform 
undetermined convolutive BSS via frequency bin-wise 
clustering and permutation alignment where convolutive 
mixture are most delayed and weighted. So, ICA in time- 
domain is fails to separate signals. Hence, instead of this we use 
ICA in frequency -domain which playing vital role in separation 
of audio signals by using MATLAB which is our future work. 
Keywords — Compressive Sensing; Sparsity; GPSR; K- 
means; LI -magic. 
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I. 



Introduction 



A. Problem Description 

The Cocktail Party Problem means several sources are mixed 
together and it is difficult to find out what is the original signal 
were. And the concept of blind source separation [1] additionally 
called blind signal separation is that the separation of a group of 
signals from a group of mixed signals, while not the help of 
knowledge (or with little information) concerning the source 
signals or the blending method. Blind signal separation depends 
on the belief that the supply signals don't correlate with one 
another. For instance, the signals are also reciprocally statistically 
independent or decorrelated. Blind signal separation so separates 
a group of signals into a group of different signals, specified the 
regularity of every resulting signal is maximized, and therefore 
the regularity between the signals is decreased. 

B. Preliminary Work 

Blind separation and blind deconvolution are related issues. 
From fig it is clear that a set of sources Si(t),. . .,S N (t) (different 
peoples speaking, music, etc) are mixed together by a matrix A. 
We are aware about the sources and mixing process. All we 
receive is the N mixed signals, Xi(t),...,X N (t). So, our aim is to 
recover original signal from mixed signal by finding a square 
matrix. In blind deconvolution [1], an unknown signal s(t) is 
convolved with an unknown tapped delay-line filter, ai,...,a L 
giving a corrupted signal x(t)=a(t)*s(t) where a(t) is the impulse 
response of the filter. So, the task is to recover s(t) by 
convolving x(t) with a learnt filter Wi,...,w L which reverses the 
effect of filter a. 

So, in simple way from fig 1, the two sources "Hello" and 
"Morning" are getting added and convolved with respective of 
impulse responses to the both sensors (microphone). At last we 
get the separated original sources with the help of unmixing 
matrix W. 
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Fig 1 A diagrammatic representation of a mixing and an unmixing system 



II. 



METHODOLOGY 



are: 



The Blind source separation (BSS) [1] has received nice 
attention owing to its numerous application fields such as array 
processing, speech recognition, communications, feature 
extraction, image processing, and biomedical analysis. 
Independent Component Analysis (ICA) [2], [3], [4] is one of the 
ways used for blind source separation. The goal of ICA [2], [3], 
[4] is to get recover independent sources given sensor outputs 
within which the sources are linearly mixed. ICA not solely 
decorrelates the signal however reduces the higher-order 
statistical dependencies, trying to make the signals as 
independent as possible. 

The general assumptions in BSS [1] regarding the sources 

• The source signals are unit statistically independent and 
are stationary processes. 

• At the most one of the source signals is Gaussian 
distributed. 

• It is only possible to recover the source signals modulo 
scale, and polarity. 

In signal process, independent component analysis (ICA) [2], 
[3], [4] could be a procedure technique for separating a variable 
signal into additive subcomponents by assumptive that the 
subcomponents are non-Gaussian signals which they're all 
statistically independent from one another. ICA could be a 
special case of blind source separation. The BSS [1] based on 
independent component analysis (ICA) [2] are often classified 
into two groups in terms of process domain such as, 

A. ICA in time-domain 

B. ICA in frequency -domain 

ICA in time-domain [3] and ICA in frequency -domain [4] are 
combined can do a superior separation of the sources under 
reverberant condition i.e., in acoustic space. But, generally 
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speaking, conventional time-domain ICA is fails to separate 
sources under heavily reverberant conditions as a result of the 
low convergence within the low iterative learning of the inverse 
of mixing process. On the opposite hand, frequency -domain ICA 
also degrades significantly because the independence 
assumptions of narrow-band signals collapses once the number of 
sub-bands will increases. So, our main aim is to separate the 
signals from mixed signal in reverberant condition i.e., by using 
convolutive mixing [5], but before that we see the algorithms for 
additive mixture using time-domain ICA [3] and frequency- 
domain ICA [4]. 



III. 



ADDITIVE MIXING 



A. ICA in time-domain 

Let us consider m independent signals Si, S M summarized in 

a vector S= (Si, S M ) T , where T denotes the transposition. The m 

independent sources generate signals S(t) at discrete times t = 1, 

2, Let us assume that we can observe only their n linear 

mixtures, x= (x l5 X N ) T 

x(t) = As(t) (1) 



Or, in component form, 

*(0 = £(A* 



(2) 



From the given observed signal, x(l), x(t),we would like to 

recover s(l ),.... ,s(t) without knowing the matrix A. When n = m, 
the matter reduces to on-line estimation of A or its inverse, W. 
We know that, x= As 



and 

Therefore, 



y= wx 
y= wAs 



To minimize cost function [3], let us first consider a 
candidate A of the mixing matrix within the overcomplete case 
and put 

y = A T x (3) 

So, x = Ay (4) 

Here, y is an estimate of original signal s. However there are 

number of y satisfying equation which does not give original s 

even when A is the true mixing matrix. 

Let us consider, the probability density function p(y,A) of y 

determined by A e S m n . So, our target is to make the parts of y as 

independent as possible. Let us, choose an independent 

distribution of y 



q(y)=Y[qa(y a ) 



(5) 



By using Kullback divergence [3] between two distributions 
p(y,A) and q(y), we can reduce cost function 



CA = KIlp(y,a):q(y)] 

= lp(y,a).lo g ^dy 
J q(y) 



(6) 



This shows that how far the p(y, A) is from the independent 
distribution q(y) and is reduced by y = A T x are independent under 
a certain condition. The entropy term is, 
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-H = | p(y,a)Aog p(y,a)dy 



(7) 



does not depend on A. Hence, this is equivalent to following cost 
function term, 



C(A) = -EC£logq a (y a )]-c 



Where, c is the entropy of A. 
The gradient [3] of 



l(y,a) = -^ogq a (y a ) 



(8) 



(9) 



is calculated by 

dl = <p(yfdy = (p(y) T dA T x (10) 

Where, (p(y) is called activation function and as speech is 
super-Gaussian distribution, 

<p(y) = V(Pi(yi\^(Pn(yn)\ 

«(X-) = --y-logftO> f ) 
dy t 

and dy = dA T x 

is used. We then have ordinary gradient 

Vl = x(p(yf =Ay<p(y) T (11) 

But, the steepest descent direction of the cost function C [3] is 

□ 

given by natural gradient [3] v / , which is given as 



V/ = V/-A(V/fA 

= A(y<p(y) T -<p(y)y T A T A 
Hence, the gradient [3] is 

Vl = <p(y)x T 

Therefore, by using this natural gradient [3] is 

Vl = Vl-W{Vl} T W 

y<p(y) T W 



(12) 



(13) 



= <p(y)x 
The learning rule [3] is 



vw; 



□ 

-n, VI : 



-nMy t W-yAyfw t } 



(14) 



(15) 



In this way, when n=m and no prewhitening process takes 
place, we get the natural gradient [3] formula which is given by 

Vl = (I-0(y)y T )W (16) 
This is the algorithm which we followed. 
Result 

At last, we are able to separate original signals from mixed 
signal. To implement this we used two source signals i.e., male 1 
and male 2. 

1. Original signals: - 
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Fig.A.l Original male 1 




Fig.A.2 Original male 2 

2. Separated signals:- 
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Fig.A.3 Separated male 1 




Fig.A.4 Separated male 2 



B. ICA in frequency-domain 

To overcome the problems which we are facing in ICA in 
time-domain, we used ICA in frequency -domain [4]. As we 
know that the convolution in time-domain is the multiplication 
in frequency-domain. 

Therefore, x (cd) ^A^colS^co) ( 17 ) 

k 

Where, Xi(co), A ik (co) and S k (co) are the Fourier transform of 
x(t), h(t) and s(t) respectively. 

Here we divide the signals into number of frames of say 40 
ms. The difficult problem during this is the ambiguities [4] of 
permutation and amplitude. So to resolve this drawback we tend 
to used envelope of detector. Therefore, we propose a technique 
based on windowed-Fourier transform [4] i.e., known as 
spectrogram [4] . 

Spectrogram [4] is a visual illustration of the spectrum of 
frequencies as they vary with the time or another variable. Graph 
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shows: the horizontal axis is time, the vertical axis is frequency; a 
third dimension indicating the amplitude of a particular frequency 
at a particular time. 




£44 



Fig B.l The windowed-Fourier transform 

If the delay and reflections aren't too long, we are able to 
ignore the convolutions. However, the new drawback will occurs 
i.e., ambiguities [4] of amplitude and permutations. Therefore, 
we have to remove these ambiguities so as to reconstruct the 
signal. Our plan is to use the inverse of the decorrelating matrices 
and also the envelope of the speech signal. This can be possible 
because of the temporal structure of the acoustic signal that it is 
stationary for a short period but not stationary for a long term. 

Therefore by following windowed-Fourier transform [4], 

X(o),t s ) = A(o)).S(o),t s ) (18) 

Where, X(co,t s ) and S(co,t s ) are the windowed-Fourier transform 
[4] of x(t) and s(t). The windowed-Fourier transform is defined 

as, 

X (co, t s ) = £ e- jm x(t)co(t -t s ) ( 1 9) 

Where, co is the frequency, N is the number of points of discrete 
Fourier transform, t s is the window position. 

Let us define X(co,t s ) and S(co,t s jfor a fixed frequency co as 
X m (t s ) and co as (t s ). Equation (18) becomes 

X a (t s ) = A(co)S ol (t s ) (20) 

It shows that convolutive mixtures are an instantaneous 
mixture for fixed frequency co. Therefore, we can apply any ICA 
algorithm for each frequency and easily separate the signals. So, 
we have a separated time sequence for each frequency. 

u a (t s ) = B(co)X a (t s ) (21) 

By aligning this sequence for every frequency and by applying 
inverse Fourier-transform, we are able to reconstruct separated 
signal. But, ICA [4] cannot solve the ambiguity of amplitude and 
permutation. So, we tend to see the way to solve those two 
problems. 

• The drawback of irregular amplitude are often solved by 
putting back the separated independent components to 
the sensor device input with the inverse matrices B(co)' 1 . 

• Due to the similarities of independent components in 
many frequencies, these components are classified and 
also the permutation is solved. 



IJSET@2015 



Page 266 



^P^v International Journal of Scientific Engineering and Technology 
Volume No.4 Issue No.4, pp: 264-267 

JSET 
Result 

Here, we successfully separated the signals form mixed signal. 
1. Mixed Signal: - 




Fig. B.2 mixed signal 1 




Fig. B.3 Mixed signal 2 
2. Separated Signals: 




Fig. B.4 Separated male 1 signal 




Fig. B.5 Separated male 2 signal 



IV. 



Result Analysis 



As we know that ICA in time-domain is fails to separate the 
signals under reverberant conditions because of delayed and 
weighted signals, we are going to use ICA in frequency -domain 
[5]. 

The BSS technique is applied to an undetermined [5] case 
wherever the number of microphones are less than sources. The 
separation technique is performed within the frequency-domain 
that comprises two stages. 

• In the very first stage, the mixture of frequency -domain 
samples is clustered into each source by an expectation- 
maximization (EM) algorithm [5]. 
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Since the cluster is completed in a very frequency bin- 
wise manner, the permutation ambiguities [5] of these 
bin-wise clustered samples ought to be aligned. This is 
going to solved in second stage by using the probability 
on how likely each sample belongs to the assigned class. 



So, to separate such an undetermined mixtures, we uses T-F 
masking [5]. Also the EM algorithm [5] is an iterative 
methodology for locating most chance or most posterior 
estimates of parameters in statistical models. 

The advantage of this two stage structure of clustering part 
improves the separation of signals from mixtures compared with 
the methods based on TDOA. Also, this permutation alignment 
technique performs better than a traditional technique based on 
amplitude envelopes. 

CONCLUSION 

Thus, we got the separated signals from mixed signal by 
applying ICA method in time-domain and frequency-domain for 
the case of additive mixing. We also solved the ambiguities of 
amplitude and permutation in frequency-domain. Our future 
study will include the undetermined convolutive blind source 
separation using ICA in frequency-domain contains frequency 
bin-wise clustering and permutation ambiguities which improves 
the separation performance. 
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